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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Lai, Preeti 

Corley, Neil C. 

(ii) TITLE OF THE INVENTION: HUMAN SHORT CHAIN DEHYDROGENASE 

(iii) NUMBER OF SEQUENCES: 3 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Dr. 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE / DOCKET NUMBER: PF-0475 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-855-0555 

(B) TELEFAX: 650-845-4166 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: PROSNOT01 

(B) CLONE: 356351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



o 

Cj 
m 

■ , s 

n i 

■■: 

'r'r- 



1 



PF-0475-2 DIV 



Met Ala Ala Pro Met Asn Gly Gin Val Cys Val Val Thr Gly Ala Ser 

1 5 10 15 

Arg Gly lie Gly Arg Gly lie Ala Leu Gin Leu Cys Lys Ala Gly Ala 

20 25 30 

Thr Val Tyr lie Thr Gly Arg His Leu Asp Thr Leu Arg Val Val Ala 

35 40 45 

Gin Glu Ala Gin Ser Leu Gly Gly Gin Cys Val Pro Val Val Cys Asp 

50 55 60 

Ser Ser Gin Glu Ser Glu Val Arg Thr Leu Phe Glu Gin Val Asp Arg 
65 70 75 80 

Glu Gin Gin Gly Arg Leu Asp Val Leu Val Asn Asn Ala Tyr Ala Gly 

85 90 95 

Val Gin Thr lie Leu Asn Thr Arg Asn Lys Ala Phe Trp Glu Thr Pro 

100 105 110 

Ala Ser Met Trp Asp Asp lie Asn Asn Val Gly Leu Arg Gly His Tyr 

115 120 125 

Phe Cys Ser Val Tyr Gly Ala Arg Leu Met Val Pro Ala Gly Gin Gly 

130 135 140 

Leu lie Val Val lie Ser Ser Pro Gly Ser Leu Gin Tyr Met Phe Asn 
145 150 155 160 

Val Pro Tyr Gly Val Gly Lys Ala Ala Cys Asp Lys Leu Ala Ala Asp 

165 170 175 

Cys Ala His Glu Leu Arg Arg His Gly Val Ser Cys Val Ser Leu Trp 

180 185 190 

Pro Gly lie Val Gin Thr Glu Leu Leu Lys Glu His Met Ala Lys Glu 

195 200 205 

Glu Val Leu Gin Asp Pro Val Leu Lys Gin Phe Lys Ser Ala Phe Ser 

210 215 220 

Ser Ala Glu Thr Thr Glu Leu Ser Gly Lys Cys Val Val Ala Leu Ala 
225 230 235 240 

Thr Asp Pro Asn He Leu Ser Leu Ser Gly Lys Val Leu Pro Ser Cys 

245 250 255 

Asp Leu Ala Arg Arg Tyr Gly Leu Arg Asp Val Asp Gly Arg Pro Val 
■il 260 265 270 

Gin Asp Tyr Leu Ser Leu Ser Ser Val Leu Ser His Val Ser Gly Leu 

275 280 285 

Gly Trp Leu Ala Ser Tyr Leu Pro Ser Phe Leu Arg Val Pro Lys Trp 

290 295 300 

He He Ala Leu Tyr Thr Ser Lys Phe 
305 310 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: PROSNOT01 

(B) CLONE: 356351 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

CTAACTTTGG CCTGGGACTC TGCCCCTCTA CCTCAGCACA GAATCGCCCC GGGTCCTACT 
ACAGAATCAA TCCTTGAACA CTGCCTCCAC GTCGCCGGCT CAATCTGGGC GAGAACCCAG 
ACTTCCACCG CAGCCCCGCA ATCTGCAGAC CTCAGCGGCA GCGCAGGTGG CAGACCTGCC 
TCCTTTGCCT GTGAGTCATG GCAGCTCCCA TGAATGGCCA AGTGTGTGTG GTGACTGGTG 
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CCTCCAGGGG TATTGGC CGT GGCATTGCCT TGCAGCTCTG CAAAGCAGGC GCCACAGTTT 3 00 

ACATCACTGG CCGCCATCTG GACACCCTTC GCGTTGTTGC TCAGGAGGCA CAATCCCTCG 3 60 

GGGGCCAATG TGTGCCTGTG GTGTGCGATT CAAGCCAGGA GAGTGAAGTG CGAACGCTGT 420 

TTGAGCAAGT GGATCGGGAA CAGCAAGGGC GTCTAGATGT GCTGGTCAAC AATGCTTATG 480 

CAGGGGTCCA GACGATCCTG AACACCAGGA ATAAGGCATT CTGGGAAACC CCTGCCTCCA 540 

TGT GGGATGA TATCAACAAC GTCGGACTCA GAGGCCACTA CTTTTGCTCA GTGTATGGGG 600 

CACGGCTGAT GGTACCAGCT GGCCAGGGGC TCATCGTGGT CATCTCCTCC CCAGGAAGCC 660 

TGCAGTATAT GTTCAATGTC CCCTATGGTG TGGGCAAAGC TGCGTGTGAC AAGCTGGCTG 72 0 

CTGACTGTGC CCACGAGCTG CGGCGCCATG GGGTCAGCTG TGTGTCTCTG TGGCCGGGGA 780 

TTGTGCAGAC AGAACTGCTG AAGGAGCATA TGGCAAAGGA GGAGGTCCTG CAGGATCCTG 840 

TGTTGAAGCA GTTCAAATCA GCCTTCTCAT CTGCAGAAAC CACAGAATTG AGTGGCAAAT 900 

GTGTGGTGGC TTTGGCAACA GATCCCAATA TCCTGAGCCT GAGTGGTAAG GTGCTGCCAT 9 60 

CCTGTGACCT TGCTCGACGC TATGGCCTTC GGGATGTGGA CGGCCGCCCC GTCCAAGACT 1020 

ATTTGTCTTT GAGCTCTGTT CTCTCACACG TGTCCGGCCT GGGCTGGCTG GCCTCCTACC 1080 

TGCCCTCCTT CCTCCGTGTG CCCAAGTGGA TTATTGCCCT CTACACTAGC AAGTTC TAAC 114 0 

CCTCCTGGTC TGACACTACG TCTCTGCTTG TCTTCTCATT TGGACTTGGT GGTTCGTCCT 12 0 0 

GTCTCAGTGA AACAGCAGCC TTTC TTGTTT ACCCATACCC TTGATATGAA GAGAAGCCCT 1260 

CTGCTGTGTG TCCGTGGTGA GTTCTGGGGT GCGCCTAGGT CCCTTCTTTG TGCCTTGGTT 132 0 

TTCCTTGTCC TTCTTTTTAC TTTTTGCCTT AGTATTGAAA AATGCTCTTG GAGCTAATAA 1380 

AAGTCTA 13 87 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 2315796 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Gly Val He Leu Gin Asp Gin Val Ala Leu Val Thr Gly Ala Ser 

15 10 15 

Arg Gly He Gly Arg Gly He Ala Leu Gin Leu Gly Glu Ala Gly Ala 

20 25 30 

Thr Val Tyr lie Thr Gly Arg Arg Pro Glu Leu Ser Asp Asn Phe Arg 

35 40 45 

Leu Gly Leu Pro Ser Leu Asp Tyr Val Ala Lys Glu He Thr Ser Arg 

50 55 60 

Gly Gly Lys Gly He Ala Leu Tyr Val Asp His Ser Asn Met Thr Glu 
65 70 75 80 

Val Lys Phe Leu Phe Glu Lys He Lys Glu Asp Glu Glu Gly Lys Leu 

85 90 95 

Asp He Leu Val Asn Asn Val Tyr Asn Ser Leu Gly Lys Ala Thr Glu 

100 105 110 

Met He Gly Lys Thr Phe Phe Asp Gin Asp Pro Ser Phe Trp Asp Asp 

115 120 125 

He Asn Gly Val Gly Leu Arg Asn His Tyr Tyr Cys Ser Val Tyr Ala 

130 135 140 

Ala Arg Met Met Val Glu Arg Arg Lys Gly Leu He Val Asn Val Gly 
145 150 155 160 

Ser Leu Gly Gly Leu Lys Tyr Val Phe Asn Val Ala Tyr Gly Ala Gly 

165 170 175 

Lys Glu Ala Leu Ala Arg Met Ser Thr Asp Met Ala Val Glu Leu Asn 
180 185 190 
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Pro Tyr Asn Val Cys 
195 

Glu Thr Ala Asn Arg 
210 

Glu Asn Pro Glu Leu 
225 

Thr Gly Lys Ala Leu 
245 

Lys Ser Gly Lys Thr 
260 

Phe Ser Asp Lys His 
275 

lie Arg Thr lie Leu 
290 

lie Pro Pro Gin lie 
305 

Asn Arg Phe 



Val Val Thr Leu He Pro 
200 

Thr He He Asp Asp Ala 
215 

Glu Glu Phe He Lys Gly 
230 235 
Ala Arg Leu Ala Met Asp 
250 

Leu Phe Thr Glu Asp Leu 
265 

Gly Ala Gly Met Glu Pro 
280 

Gly Thr Met Gly Lys Glu 
295 

Lys Leu Pro Lys Trp Val 
310 315 



Gly Pro Val Lys Thr 
205 

Tyr Lys Met He Lys 
220 

Glu Ser Thr Glu Tyr 
240 

Pro Gly Lys Leu Lys 
255 

Ala Gin Lys Tyr Asp 
270 

Gin Asn He Arg Ser 
285 

Glu Val Ala Lys Tyr 
300 

He Trp Gin Ser Val 
320 



